Tag
1 article
This explainer explores MEM, a multi-scale memory system that extends the context window of Vision-Language-Action (VLA) models to 15 minutes, enabling robots to perform complex, multi-step tasks.